NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Continual Learning Using Only Large Language Model Prompting

Qiu, Jiabao; Ke, Zixuan; Liu, Bing (January 2025, The 31st International Conference on Computational Linguistics (COLING-2025))

We introduce CLOB, a novel continual learning (CL) paradigm wherein a large language model (LLM) is regarded as a black box. Learning is done incrementally via only verbal prompting. CLOB does not fine-tune any part of the LLM or add any trainable parameters to it. It is particularly suitable for LLMs that are accessible via APIs. We also propose a new CL technique, called CIS, based on incremental summarization that also overcomes the LLM’s input length limit. Experiments show CIS outperforms baselines by a very large margin.
more » « less
Free, publicly-accessible full text available January 19, 2026
In-Context Continual Learning Assisted by an External Continual Learner

Momeni, Saleh; Mazumder, Sahisnu; Ke, Zixuan; Liu, Bing (January 2025, The 31st International Conference on Computational Linguistics (COLING-2025))

Existing continual learning (CL) methods mainly rely on fine-tuning or adapting large language mod- els (LLMs). They still suffer from catastrophic for- getting (CF). Little work has been done to exploit in-context learning (ICL) to leverage the extensive knowledge within LLMs for CL without updating any parameters. However, incrementally learning each new task in ICL necessitates adding training examples from each class of the task to the prompt, which hampers scalability as the prompt length in- creases. This issue not only leads to excessively long prompts that exceed the input token limit of the underlying LLM but also degrades the model’s performance due to the overextended context. To address this, we introduce InCA, a novel approach that integrates an external continual learner (ECL) with ICL to enable scalable CL without CF. The ECL is built incrementally to pre-select a small subset of likely classes for each test instance. By restricting the ICL prompt to only these selected classes, InCA prevents prompt lengths from becom- ing excessively long, while maintaining high per- formance. Experimental results demonstrate that InCA significantly outperforms existing CL base- lines, achieving substantial performance gains.
more » « less
Free, publicly-accessible full text available January 19, 2026
Open-world continual learning: Unifying novelty detection and continual learning

https://doi.org/10.1016/j.artint.2024.104237

Kim, Gyuhak; Xiao, Changnan; Konishi, Tatsuya; Ke, Zixuan; Liu, Bing (January 2025, Artificial Intelligence)

As AI agents are increasingly used in the real open world with unknowns or novelties, they need the ability to (1) recognize objects that (a) they have learned before and (b) detect items that they have never seen or learned, and (2) learn the new items incrementally to become more and more knowledgeable and powerful. (1) is called novelty detection or out-of-distribution (OOD) detection and (2) is called class incremental learning (CIL), which is a setting of continual learning (CL). In existing research, OOD detection and CIL are regarded as two completely different problems. This paper first provides a theoretical proof that good OOD detection for each task within the set of learned tasks (called closed-world OOD detection) is necessary for successful CIL. We show this by decomposing CIL into two sub-problems: within-task prediction (WP) and task-id prediction (TP), and proving that TP is correlated with closed-world OOD detection. The key theoretical result is that regardless of whether WP and OOD detection (or TP) are defined explicitly or implicitly by a CIL algorithm, good WP and good closed-world OOD detection are necessary and sufficient conditions for good CIL, which unifies novelty or OOD detection and continual learning (CIL, in particular). We call this traditional CIL the closed-world CIL as it does not detect future OOD data in the open world. The paper then proves that the theory can be generalized or extended to open-world CIL, which is the proposed open-world continual learning, that can perform CIL in the open world and detect future or open-world OOD data. Based on the theoretical results, new CIL methods are also designed, which outperform strong baselines in CIL accuracy and in continual OOD detection by a large margin.
more » « less
Full Text Available
Open-world continual learning: Unifying novelty detection and continual learning

Kim, Gyuhak; Xiao, Changnan; Konishi, Tatsuya; Ke, Zixuan; Liu, Bing (October 2024, Artificial intelligence)

Full Text Available
Sub-network Discovery and Soft-masking for Continual Learning of Mixed Tasks.

Ke, Zixuan; Liu, Bing; Xiong, Wenhan; Celikyilmaz, Asli; Li, Haoran (December 2023, ACL)

Full Text Available
Continual Pre-training of Language Models

Ke, Zixuan; Shao, Yijia; Lin, Haowei; Konishi, Tatsuya; Kim, Gyuhak; Liu, Bing (May 2023, Proceedings of The Eleventh International Conference on Learning Representations (ICLR-2023))

Full Text Available
A Theoretical Study on Solving Continual Learning

Kim, Gyuhak; Xiao, Changnan; Konishi, Tatsuya; Ke, Zixuan; Liu, Bing (November 2022, Proceedings of Thirty-sixth Conference on Neural Information Processing Systems (NeurIPS-2022))

Full Text Available
Continual Training of Language Models for Few-Shot Learning

Ke, Zixuan; Lin, Haowei; Shao, Yijia; Xu, Hu; Shu, Lei; Liu, Bing (December 2022, Proceedings of The 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP-2022))

Full Text Available
Adapting a Language Model While Preserving its General Knowledge

Ke, Zixuan; Shao, Yijia; Lin, Haowei; Xu, Hu; Shu, Lei; Liu, Bing (December 2022, Proceedings of The 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP-2022))

Full Text Available
CLASSIC: Continual and Contrastive Learning of Aspect Sentiment Classification Tasks

https://doi.org/10.18653/v1/2021.emnlp-main.550

Ke, Zixuan; Liu, Bing; Xu, Hu; Shu, Lei (November 2021, Proceedings of 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP-2021))

Full Text Available

« Prev Next »

Search for: All records